S 15 a . 13 A STUDY OF LSF REPRESENTATION FOR SPEAKER - DEPENDENT AND SPEAKER - INDEPENDENT HMM - BASED SPEECH RECOGNITION SYSTEMS
نویسنده
چکیده
In this paper, the line spectral-pair frequency (LSF) representation is used as the parametric representation for speech recognition. Its performance is compared with that of the cepstral cc-efficient (CC) representation for the speaker-dependent and speaker-independent hidden Markov model (HMM) based isolated word recognition systems. It is shown that the CC and the LSF representations result in comparable recognition performances for the full covariance matrix case. But, for the diagonal covariance matrix case, the LSF r e p resentation provides significantly better recognition performance than the CC representation.
منابع مشابه
On the use of line spectral frequency parameters for speech recognition
The line spectral frequency (LSF) representation has been proposed by Itakura [l] as an alternative linear prediction (LP) parametric representation. In the context of speech coding, it has been shown [2-61 that this representation has better quantization properties than the other LP parametric representations (such as log area ratios and reflection coefficients). The LSF representation is capa...
متن کاملRobust distant speaker recognition based on position-dependent CMN by combining speaker-specific GMM with speaker-adapted HMM
In this paper, we propose a robust speaker recognition method based on position-dependent Cepstral Mean Normalization (CMN) to compensate for the channel distortion depending on the speaker position. In the training stage, the system measures the transmission characteristics according to the speaker positions from some grid points to the microphone in the room and estimates the compensation par...
متن کاملHMM adaptation for child speech synthesis
Hidden Markov Model (HMM)-based synthesis in combination with speaker adaptation has proven to be an approach that is well-suited for child speech synthesis [1]. This paper describes the development and evaluation of different HMM-based child speech synthesis systems. The aim is to determine the most suitable combination of initial model and speaker adaptation techniques to synthesize child spe...
متن کاملSpeaker-dependent Speech Recognition Based on Phone-like Units Models | Application to Voice Dialing
This paper presents a speaker dependent speech recognition with application to voice dialing. This work has been developed under the constraints imposed by voice dialing applications, i.e., low memory requirements and limited training material. Two methods for producing speaker dependent word baseforms based on Phone Like Units (PLU) are presented and compared : (1) a classical vector quantizer...
متن کاملSpeaker Adaptation Using Multiple Reference Speakers
We introduce a new technique for using the speech of multiple reference speakers as a basis for speaker adaptation in large vocabulary continuous speech recognition. In contrast to other methods that use a pooled reference model, this technique normalizes the training speech from multiple reference speakers to a single common feature space before pooling it. The normalized and pooled speech can...
متن کامل